REVERSE MATHEMATICS AND 
A RAMSEY-TYPE KONIG'S LEMMA 



STEPHEN FLOOD 

Abstract. In this paper, we propose a weak regularity principle which is 
similar to both weak Konig's lemma and Ramsey's theorem. We begin by 
studying the computational strength of this principle in the context of reverse 
mathematics. We then analyze different ways of generalizing this principle. 



1. Introduction 

Weak Konig's lemma states that for any infinite binary tree T C 2<^, there is an 
infinite path p through T. When formafized in second order arithmetic, this theorem 
is denoted WKLq. There is a direct correspondence between a path p through a 
binary tree, which is a function p : N {0, 1}, and the set {x : p{x) = 1}. With 
this in mind, we occasionahy identify paths through trees, colorings of singletons, 
and subsets of N. 

Ramsey's theorem is a generalization of the pigeon-hole principle. Given n e N, 
let [N]" denote the collection of size n subsets of N. The infinite version of Ramsey's 
theorem says that for every n,k G N and every function (called a coloring) / : 
[N]" — >■ {0, . . . , fc — 1}, there is an infinite set iJ C N which is given one color by /. 
In the context of second order arithmetic, this principle is denoted RTj! (with n, k 
as above). 

Weak Konig's lemma and Ramsey's theorem can be thought of as asserting 
the existence of different types of regularity. Viewed topologically, weak Konig's 
lemma is essentially the statement "2^ is compact." This carries over to reverse 
mathematics, where WKLq is equivalent to many theorems about compactness (see 
|13|). For its part, Ramsey's theorem says that no matter how badly behaved a 
coloring is, it always has a sizable homogeneous set. In the words of T.S. Motzkin, 
absolute disorder is impossible. 

In this paper, we study the computational and reverse mathematical strength of 
a regularity principle which combines features of weak Konig's lemma and Ramsey's 
theorem. We will refer to the statement "for each infinite binary tree T, there is 
an infinite set H homogeneous for a path through T" as a Ramsey-type Konig's 
lemma. In statement [2j we give a formal definition of our Ramsey-type Konig's 
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lemma, denoted RKL, in terms of finite strings, ft is the novelty of this principle, 
rather than the complexity of the proofs, that is the main innovation of this paper. 

We begin by showing that RKL is a nontrivial weakening of WKLq and of RTj. 
More formally, we show that SRTj implies RKL, that WKLq implies RKL, and that 
RKL implies DNR (unless specified, we always work over RCAq). Applying results 
of [1] and [H], we conclude that RKL is strictly weaker than WKLq and SRT2. 

In the remaining sections, we state analogs of RKL for trees generated by infinite 
sets of strings (RKL^^') and for arithmetically-definable trees (RKL'-^"-'). We then 
study the strength of each principle, and obtain the surprising result that these 
stronger principles are more closely related to RTj than to WKLq. 

We show that RTj implies RKL^^^ and that RKL^^^ implies SRT2. By the main 
results of [S] and [TT], it follows that RKL*-^-* and WKLq are incomparable. We also 
show that RT2 does not imply RKL^^"-* and, by using a result of [11 , we show that 
RKL«") does not imply WKLq. We leave open whether RKL^^^^^ implies RTj. We 
summarize our results in figure [l] 

1.1. Working in second order arithmetic. We assume that the reader is fa- 
miliar with the basic definitions and results of computability theory and reverse 
mathematics. For an introduction to reverse mathematics, see chapter I of |13| . 
For an introduction to computability theory, see part A of [H]. 

Some care is required to formalize RKL in RCAq. Our goal here is to study the 
computational complexity of the homogeneous set iJ, not of the path p. While 
there are computable trees T such that each path through T is reasonably compli- 
cated, there are paths p with relatively simple homogeneous sets H. We begin with 
definitions that allow us to say that H is homogeneous for some path through T 
without explicit reference to a path p. 

Definition 1 (RCAq). H is homogeneous for a £ with color c E {0,1} if 
(y{x) = c for each x € H s.t. a; < |cr|. iJ is homogeneous for a path through T if 
3c e {0, 1} s.t. H is homogeneous for a with color c for arbitrarily long a E T. 

Statement 2 (RCAq). RKL asserts that "for each infinite binary tree T, there is 
an infinite set H which is homogeneous for a path through T." 

Unless stated otherwise, all strings and trees we consider will be binary ({0, 1}- 
valued). Given r, cr € 2^^^ we write r ^ cr if r is an initial segment of a. We write 
a \ t (or p \ t) to denote the initial segment of cr G 2*~^ (or p E 2^) of length t. 

2. Reverse mathematics of RKL 
Theorem 3 (RCAo). WKLo implies RKL. 

Proof. Given an infinite binary tree T, let p be an infinite path through T. Note that 
p : N — {0, 1} maps singletons into two colors. Applying RTj, which is provable in 
RCAq, yields a set H which is homogeneous for p. In particular, p \ t E T and H is 
homogeneous for p \ t for each t E N. Thus H satisfies definition [l] as desired. □ 

Definition 4 (RCAq). A coloring / : [N]^ — > {0,1} is stable if for each x, there 
is some t > x s.t. (Vy > t)[f{x,y) — f{x,t)]. SRT^ is the theorem "every stable 
coloring of pairs into two colors has an infinite homogeneous set." 

Theorem 5 (RCAo). SRT2 implies RKL. 
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Proof. Given an infinite tree T, we define a coloring / : [N]'^ — > {0, 1} as follows. 
For each y, let ay be the lexicographically least element of T of length y. For each 
X <y, define f{x,y) = ay{x). 

We now show that / is a stable coloring. Fix x, and let T*^^* denote the strings 
in T that are extended by arbitrarily long strings in T. For each t E T — T*^^* of 
length X + 1, there is a bound on the length of strings in T extending t, so there is 
a least such bound s^. Note that is definable (with parameters) from t. By 

induction, there is a uniform upper bound t for {s^ : r G 2^+^ Are y _ ye^tj^ 
By Hi induction, there is a lexicographically least element t^+i E T"^^* of length 
x+1. Then for each y > t, Uy \ {x + \) — t^+i hence /(x, y) = t,j;+i{x). In general, 
for each x, {3t){yy > t)[f{x,y) — Tx+i{x)]. In other words, / is a stable coloring. 

Suppose that H is homogeneous for / with color c E {0, 1}. We now show that H 
is homogeneous for a path through T. Fix t E N. Because H is an infinite set, there 
is an element y E H with y >t. By the definition of /, (Vx < y)[ay{x) — f{x,y)]. 
Because H is homogeneous for / with color c and because y E H, (Vx < y)[x E 
H =^ '^y{^) = c]. Then H is homogeneous for ay E T with color c. Since t is 
arbitrary and \ay \ > t, we have satisfied definition [I] □ 

Corollary 6 (RCAq). RKL does not imply SRTI or WKLq. 

Proof. By the main result of [TT], SRTj does not imply WKLq. Because SRTj 
implies RKL, RKL cannot imply WKLq. Similarly, RKL cannot imply SRTj over 
RCAq because WKLq does not imply SRT2 (by Theorem 3.3 of [T2 and Theorem 
10.5of|lj). ' ' □ 

We conclude our analysis of RKL by showing that it is not provable in RCAg, 
and by showing that RKL is strong enough to imply DNR. When T C is a tree, 
[T] C 2**^ will be the set of infinite paths through T. The following lemma follows 
from the proof of lemma 2 in [TOj . 

Lemma 7 (Jockusch, [10 ). There is an infinite computable tree T such that for 
any p E [T] and for any e E N, if \We\ > e + 3 then We is not homogeneous for 
p. In fact, for each e E N, there is a t E N s.t. if \We\ > e + 3 then W^, is not 
homogeneous for any .string in T of length greater than t. 

A simple corollary is that RCAq 1/ RKL, via an w-model. Adapting the proof 
of Theorem 2.3 from [8 , we can obtain a slightly stronger result. We say that a 
function / is diagonally non- computable relative to X if /(e) ^ (e) for each e 
s.t. ^^(e) |. The principle DNR asserts that for any set parameter X, there is a 
function that is diagonally non-computable relative to X. 

Theorem 8 (RCAq). RKL implies DNR. 

Proof. We work relative to a set parameter X. The proof of lemma 2 of fTU] (lemma 
[7] above) works in RCAq. Let T be the tree defined in this proof. By RKL, there 
is a set H homogeneous for a path through T. Note that there is a definable 
function 5 : N — >■ N such that Wg(e) is the least e + 3 elements of H (in the N order). 

We now show that g is a fixed point free function. If \We\ < 2*^+^, then |VFg(e)| 7^ 
\We\. Suppose that \We\ > 2^+^. By the above lemma, there is some t s.t. We 
is not homogeneous for any a E T of length greater than t. Because Wg(e) C H 
and because H is homogeneous for a path through T, Wgi^e) is homogeneous for 
arbitrarily long a E T. In particular We 7^ ^g{e)- In other words, g is fixed point 
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free, so can be used to give a A° definition for a DNR function (formalize V.5.8 of 
[g in RCAo). □ 

Question 9. Does DNR imply RKL? 

A number of principles are known to be stronger than DNR, such as ASRAM and 
ASRTj from [7 . Proving that one of these principles does not imply RKL would 
separate DNR from RKL. 

3. Trees generated by sets of strings 

Definition 10. Given an infinite set of strings S, let denote the downward 
closure of S. More formally Ts = {t : (3a G E)[t ^ a]}. 

Statement 11 (RCAq). RKL*-^' asserts that "for each infinite set of strings S, there 
is a set H which is homogeneous for a path through T-^." 

Note that if S is an infinite computable set of strings, is an infinite c.e. tree. 
In [B], Downey and Jockusch note that each infinite H^'" -class can be generated by 
a c.e. tree. We extend this slightly, to further motivate our definition of RKL*-"'^-'. 

Proposition 12. If T is an infinite IIj tree, then there is an infinite computable 
set of strings T, s.t. [T] = [Te]. Furthermore, S can be taken to contain exactly 
one string of each length. 

Proof. It suffices to consider E° trees. To see this, suppose that T is II^. Then there 
is a formula (j) which is Af s.t. r e T o {yy){3z)(j){T, y, z). Using the formula 
'0(t, z) =def (Vx, y < |r|)(3z < z)(t){T \ x, y, z), we can define a tree 5 by r e S* 
O (3i)V'(T,z). Then [S] = {/ : (Vu;)(3z)^(/ \ w,z)} = {/ : (Vx)(V2/)(3z)0(/ \ 
X, y, z)} — [T], so we may work with S instead. 

Given a tree T, fix a computable enumeration {T^} of T. If necessary, we 
computably modify the enumeration to ensure that no r enters Tg until s > |r| 
and that exactly one string enters T at each stage. We computably enumerate the 
elements of E in increasing order. At stage s > 0, find t E Tg — Tg^i, take one 
cT ^ T with |it| = s (the specific choice is not important), and put a into S. It is not 
difficult to show that TCT^ and that T|^* C T^^*. It follows that [T] = [Ts]. □ 

We now examine the strength of RKL*-^'. 

Theorem 13 (RCAq). RTI implies RKl'-^K 

Proof. Fix an infinite set of strings S. For each y, let I > y be the length of the 
shortest string in E of length at least y. Let <7y be the lexicographically least string 
in E of length I. We now define a coloring / : [N]^ — > {0, 1} as before. For each 
X < y & N, set f{x,y) — (Jy{x). Note that / is Aj'-definable. 

By RTj, there is an infinite set H homogeneous for / with color c e {0, 1}. We 
claim that H is homogeneous for a path through T^. 

Fix t £ N. Because H is infinite, there is some y E H with y > t. By definition 
of /, (Vx < y)[f{x,y) = ay{x)]. Because H is homogeneous for / with color c 
and because y E H, (Vx < y)[x E H o'y{x) ~ c]. In other words, H is 

homogeneous for (jy \ y with color c. Because ay f y G Is and because y > t with 
t arbitrary, H is homogeneous for a path through Ts (in the sense of definition 

0- □ 
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Remark 14. The coloring defined in tlic proof of theorem 13 is not (necessarily) 
stable because it is defined in terms of E, and not in terms of T. For example, 
suppose that (t(0) = for even length strings cr G S, and that cr(0) = 1 for odd 
length strings. Then limj, /(O, y) does not exist. 

There is a natural correspondence between computable colorings / : [N] — > 
{0, 1} and computable sets E C 2^^*^ that contain exactly one string of each length. 
Given /, simply define S = {oy : ay £ 2^ A (Va; < y)[ay{x) = f{x,y)]}. Using 
the induced tree Ts, it is not difficult to show that RKL^ ' implies SRTj over 
RCAo + BS2- Yokoyama was able to eliminate the use of BSj by introducing the 
following principle. 

Statement 15 (RCAq). P| asserts that "for any Il^-definable subsets Ao,Ai of N 
s.t. AqU Ai ^ N, there exists an infinite set H e S{M) s.t. H C Aq ot H C Ai." 

The principle P| is particularly useful because it implies the better understood 
principle Z?!. This is a special instance of the principle D2, which we will return 
to in the next section. 

Statement 16 (RCAq). For each n G uj, D2 asserts that "for each A° -definable 
subset A of N, there exists an infinite set H £ S{M) s.t. H C A or H C A." 



Theorem 17 (Cholak, Chong, Jockush, Lempp, Slaman, Yang [1, 4 ). Over RCAq, 
D2 is equivalent to SRT2. 

Theorem 18 (Yokoyama [T51). RKL^^-* implies P|, and hence SRTj, over RCAq. 

Proof. Let M = {N,S{M)) \= RCAq + RKL^^^ and suppose that Ao,Ai are n§- 
definable subsets of N s.t. AqU Ai ^ N. We will define a A',' function / : [N]^ 
{0, 1} s.t. if /(x, y) = i for infinitely many y, then x G Ai. 

Fix two Eg formulas 0i{x,m,n) s.t. x G Ai <^=> (\fm){3n)0i{x,m,n). Using 
these formulas, we define a helper function: 

h{x,y) = (/iz)[(Vm < y){3n < z)[0o{xjm,n)] V (Vm < y){3n < z)[9i{x,m,n)]]. 

Clearly, /i is a A° function. 

We must verify in RCAq that h is total. Fix a: G N arbitrary. Then x G N = 
Aq U Ai, so X G Aq or X G Ai. So (ym){3n)9o{x,m,n) or (\/m){3n)0i{x,m,n). 
Let y e N be arbitrary. Suppose x G Ai. Then {Vm){3n)0i{x,m,n), so clearly 
(Vm < y){3n)9i{x,m,n). By BSj, there is a Zi s.t. (Vm < y){3n < Zi)0i{x,m,n). 
Thus, h will find a least z s.t. the desired condition holds. 

Define f{x,y) = if (Vm < y){3n < h{x,y))[9o{x,m,n)], and f{x,y) = 1 
otherwise. Clearly, / is total since h is total, and / is Aj since h is total and Aj. 
If f{x, y) = i for infinitely many y, then our defn of h{x^ y) confirms that x G Ai. 

Using /, let E = {ay : ay G 2^ A (Vx < y)[ay{x) — f{x,y)\} and define as 
before. Take H homogeneous for a path through Ts with some color c G {0, 1}. 

Let X G H he arbitrary. By definition of "homogeneous for a path through Ts 
with color c," there are infinitely many y s.t. /(a;, y) = ay{x) = c. By our choice of 
/, this means that x G A^.. In other words, iJ C is the desired infinite set. □ 

Question 19 (Yokoyama [15]). Does Dl imply P|? Does P| imply RKL^^^? 

Corollary 20. RKL^^-* is incomparable with WKLq over RCAq 
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Proof. Because WKLq does not imply SRTj (Theorem 3.3 of [12j and Theorem 10.5 
of d]), WKLo does not imply RKL^^^ By the main result of [H], RT2 does not 
imply WKLo, so RKL^^^ cannot imply WKLq. □ 

Remark 21. Using the above arguments, we can rephrase RTg as the statement 
"for each E which contains exactly one string of each length, there is an infinite H 
which is homogeneous (with fixed color c) for each cr G S s.t. \<t\ e H." 

Question 22. Does RKL^^^ imply COH, CAC, ADS or RT2? One imphcation holds 
if and only if all implications hold. Does SRT2 imply RKL^ '? 

4. Arithmetically-definable trees 

Statement 23 (RCAq). RKL^^"^) is the axiom scheme which, for each arithmetic 
formula </>, asserts that "if (/> defines a tree T containing arbitrarily long strings, 
there is an infinite set H which is homogeneous for a path through T." 

Theorem 24. Over RCAq, we have the following strict implications: ACAq => 
RKL^"^' =^ RKL(i) RKL. 

The implications are immediate. We have already seen that the third implication 
is strict. We now show that the first two implications are also strict. We first use 
the following result from [TT] to separate RKL'^"-* from ACAq. 

Theorem 25 (Liu, TT). For every C ^ and every coloring p : N — )■ {0, 1}, there 
exists an infinite set H homogeneous for p such that H Q) C 

Corollary 26. There is an lu- model of RKL^"^' where WKLq fails. 

Proof To build an w-model M ^ {uj,S{M)) of RKL«"\ we begin with S{M) = 
REC and add sets to S{M). 

The general strategy for creating a model of RKL'^"-* uses a list of the infinite 
trees which are arithmetically-definable from any set X e S{M). For each i G N, 
we must ensure that there is some finite stage s where we select a path p through 
the tree T, where we select an infinite set Hg homogeneous for p, and where we 
add Hg to S{AA) and close downward under <t. To ensure that ^ WKLq, we 
to select Hs s.t. iJ, ® ®j<s-i Hj > 0. 
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use Theorem 

It is possible that adding the set Hs to S{M.) causes new sets to become 
arithmetically-definable with parameters from S{Ai). Therefore, each time we add 
Hg to S{Ai)^ we create a new list containing the trees arithmetically-definable from 
®i<s^i- dovetail the lists, eventually running the general strategy for each 
tree in each list. In the fimit, we obtain M |= RKL^^"^) +^ WKLq. □ 

Corollary 27 (RCAq). RKL(<'^) does not imply WKLq. 

We separate RKL*-^"-* from RKL*-^-* with an w-modcl by the following observation. 

Lemma 28. For each n, no model of RKL'^"-* is bounded by 0". 

Proof. By the proof of lemma [t] relativized to X — 0", we obtain an 0"-computable 
infinite tree T s.t. no infinite set is homogeneous for a path through T. Since 
each 0"-computable set is W^" for some e, it follows that no infinite 0"-computable 
set is homogeneous for a path through T. □ 
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Proposition 29. RTj does not imply RKL^^"-* over RCAq. Consequently, RKL*-^-* 
does not imply RKL^^"^ over RCAq. 

Proof. By the previous lemma, there is no model of RKL^"^"^^ which is bounded by 
0^. By Theorem 3.1 of there is an w-model of RTj consisting of only I0W2 sets. 
This model is bounded by 0^ so is not a model of RKL*-^"-*. □ 

Question 30. Does RKL^^") imply COH over RCAq? Equivalently, does RKL'^^^ 
imply RT2 over RCAq? 

4.1. Subsets, co-subsets, and trees. There is a close relationship between find- 
ing subsets/cosubsets of a fixed set, and finding sets that are homogeneous for a 
path through a fixed tree. 

Statement 31 (RCAo). We define I?^" to be the axiom scheme which asserts D2 
for each n G 

Proposition 32 (RCAq). RKL^^^^ implies D^"" . 

Proof. Let M = (N,5(7W)) h RCAq + RKL^^""^ and suppose that A is a A°- 
definable subset of N. We give a II" definition for a tree T as follows. Given 
T e 2<^, we say that r e T if and only if (Vx < |r|)[r(a;) = 1 if and only if a; G A]. 

By RKL(<"\ there is a set H E S{M) which is homogeneous for arbitrarily 
long strings in T with color c G {0, 1}. Note that the only strings in T are initial 
segments of XA, so H is homogeneous for XA with color c. Then _ff C A if c = 1, 
and _ff C A if c = 0, as desired. □ 

Remark 33. For w-models, the reverse implication also holds. 

Question 34. Does D<'^ imply RKL^"^^ over RCAq? 

By results of [1], SRT2 implies BE^. 

Question 35. Are there first order consequences of RKL*-^"-* beyond BEj? 

Chong, Slaman, and Yang have recently announced a proof that D2 does not 
imply COH over RCAq 0. 

Question 36. Does D2 imply COH for any n G lu? 

Theorem 2.1 of [9] gives another way to state this question for w-models. 

Question 37. Is there any arithmetically-definable / : N — > {0, 1} such that any 
set H homogeneous for / satisfies H' » 0'? 
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Figure 1 . A summary of the principles considered. 
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